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Abstract. We study the problem of scheduling tasks for execution by a proces- 
sor when the tasks can stochastically generate new tasks. Tasks can be of different 
types, and each type has a fixed, known probability of generating other tasks. We 
present results on the random variable 5"^ modeling the maximal space needed 
by the processor to store the currently active tasks when acting under the sched- 
uler a. We obtain tail bounds for the distribution of S" for both offline and online 
schedulers, and investigate the expected value E[5"^]. 



1 Introduction 

We study the problem of scheduling tasks that can stochastically generate new tasks. 
We assume that the execution of a task r can generate a set of subtasks. Tasks can 
be of different types, and each type has a fixed, known probability of generating new 
subtasks. 

Systems of tasks can be described using a notation similar to that of stochastic 
grammars. For instance 

X^{X,X) X^{X,Y) Y^{X) Y^{Y) 

describes a system with two types of tasks. Tasks of type X can generate 2 tasks of 
type X, one task of each type, or zero tasks with probabilities 0.2, 0.3, and 0.5, respec- 
tively (angular brackets denote multisets). Tasks of type Y can generate one task, of 
type X or Y, with probability 0.7 and 0.3. Tasks are executed by one processor The 
processor repeatedly selects a task from a pool of unprocessed tasks, processes it, and 
puts the generated subtasks (if any) back into the pool. The pool initially contains one 
task of type Xq, and the next task to be processed is selected by a scheduler. 

We study random variables modeling the time and space needed to completely ex- 
ecute a task r, i.e., to empty the pool of unprocessed tasks assuming that initially the 
pool only contains task r. We assume that processing a task takes one time unit, and 
storing it in the pool takes a unit of memory. So the completion time is given by the total 
number of tasks processed, and the completion space by the maximum size reached by 
the pool during the computation. The completion time has been studied in [13], and so 
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the bulk of the paper is devoted to studying the distribution of the completion space for 
different classes of schedulers. 

Our computational model is abstract, but relevant for different scenarios. In the 
context of search problems, a task is a problem instance, and the scheduler is part of 
a branch-and-bound algorithm (see e.g. [19]). In the more general context of multi- 
threaded computations, a task models a thread, which may generate new threads. The 
problem of scheduhng multithreaded computations space-efficiently on mw/fiprocessor 
machines has been extensively studied (see e.g. [22,7,2,1]). These papers assume 
that schedulers know nothing about the program, while we consider the case in which 
stochastic information on the program behaviour is available (obtained from sampling). 

We study the performance of online schedulers that know only the past of the com- 
putation, and compare them with the optimal offline scheduler, which has complete in- 
formation about the future. Intuitively, this scheduler has access to an oracle that knows 
how the stochastic choices will be resolved. The oracle can be replaced by a machine 
that inspects the code of a task and determines which subtasks it will generate (if any). 

We consider task systems with completion probability 1, which can be further di- 
vided into those with finite and infinite expected completion time, often called subcrit- 
ical and critical. Many of our results are related to the probability generating func- 
tions (pgfs) associated to a task system. The functions for the example above are 
fx{x,y) = 0.2x^ + 0.3xy + 0.5 and fY{x,y) = 0.7x + 0.3y, and the reader can 
easily guess the formal definition. The completion probability is the least fixed point of 
the system of pgfs [17]. 

Our first results (Section 3) concern the distribution of the completion space S°p 
of the optimal offline scheduler op on a fixed but arbitrary task system with f{x) as 
pgfs (in vector form). We exhibit a very surprising connection between the probabil- 
ities Pr[S°P = k] and the Newton approximants to the least fixed point of f(x) (the 
approximations to the least fixed point obtained by applying Newton's method for ap- 
proximating a zero of a differentiable function to f{x) — a; = with seed 0). This 
connection allows us to apply recent results on the convergence speed of Newton's 
method [20, 12], leading to tail bounds of 8°^, i.e., bounds on Prl^"^ > k]. We flien 
study (Section 4) the distribution of S'^ for an online scheduler a, and obtain upper and 
lower bounds for the performance of any online scheduler in subcritical systems. These 
bounds suggest a way of assigning weights to task types reflecting how Ukely they are 
to require large space. We study light-first schedulers, in which "light" tasks are chosen 
before "heavy" tasks with larger components, and obtain an improved tail bound. 

So far we have assumed that there are no dependencies between tasks, requiring a 
task to be executed before another. We study in Section 4.3 the case in which a task can 
only terminate after all the tasks it has (recursively) spawned have terminated. These 
are the strict computations studied in [7]. The optimal scheduler in this case is the 
depth-first scheduler, i.e., the one that completely executes the child task before its par- 
ent, resulting in the famiUar stack-based execution. Under this scheduler our tasks are 
equivalent to special classes of recursive state machines [15] and probabilistic push- 
down automata [14]. We determine the exact asymptotic performance of depth-first 
schedulers, hereby making use of recent results [9]. 
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We restrict ourselves to the case in which a task has at most two children, i.e., all 
rules X ^ {Xi, . . . , X„) satisfy n < 2. This case already allows to model the forking- 
mechanism underlying many multithreaded operating systems, e.g. Unix-like systems. 

Related work. Space-efficient scheduling for search problems or multithreaded com- 
putations has been studied in [19,22,7,2,1]. These papers assume that nothing is 
known about the program generating the computations. We study the case in which 
statistical information is available on the probability that computations split or die. 

The theory of branching processes studies stochastic processes modehng popula- 
tions whose members can reproduce or die [17,4]. In computer science terminology, 
all existing work on branching processes assumes that the number of processors is un- 
bounded [3, 8, 21, 23, 25, 27]. We study the 1-processor case, and to our knowledge we 
are the first to do so. 

Structure of the paper The rest of the paper is structured as follows. The prelimi- 
naries in Section 2 formaUze the notions from the introduction and summarize known 
results on which we build. In Section 3 we study the performance ofptimal offline sched- 
ulers. Section 4 is dedicated to online schedulers. First we prove performance bounds 
that hold uniformly for all onhne schedulers, then we prove improved bounds for light- 
first schedulers, and finally we determine the exact asymptotic behaviour of depth-first 
schedulers. In Section 5 we obtain several results on the expected space consumption 
under different schedulers. Section 6 contains some conclusions. FuU proofs can be 
found in the appendix.. 

2 Preliminaries 

Let ^ be a finite set. We regard elements of N"^ and R'^ as vectors and use boldface (like 
u,v)to denote vectors. The vector whose components are all (resp. 1) is denoted by 
(resp. 1). We use angular brackets to denote multisets and often identify multisets over 
A and vectors indexed by A. For instance, if A = {X, Y} and v s N"* with vx = 1 
and vy = 2, then v = {X, Y, Y). We often shorten (a) to a. denotes the multisets 
over A containing at most 2 elements. 

Definition 2.1. A task system is a tuple A = {F, ^ , Prob , Xq) where F is a finite 
set of task types, C x Mp is a set of transition rules, Proh is a function as- 
signing positive probabilities to transition rules so that for every X €: F we have 
"Y^x^a Prob{{X, a)) = 1, and Xq G F is the initial type. 

We write X ^ a whenever X ^ a and Prob{{X,a)) = p. Executions of a task 
system are modeled as family trees, defined as follows. Fix an arbitrary total order ^ 
on F. A family tree Ms a pair {N, L) where N C {0, 1}* is a finite binary tree (i.e. a 
prefix-closed finite set of words over {0, 1}) and L : TV ^ _r is a labelling such that 
every node w G N satisfies one of the following conditions: u; is a leaf and L{w) s, 
or w has a unique child wO, and L{w) satisfies L{w) ^ L{wO), or w has two children 
wO and wl, and L(ii;0), L{wl) satisfy L{w) ^ (L(ii;0), L{wl)) and L{wO) ^ L{wl). 
Given a node w G N, the subtree of t rooted at w, denoted by tyj, is the family tree 
{N', L') such that w' e N' iff ww' e A'' and L'{w') = L{ww') for every w' e N'. If 
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a tree t has a subtree to or ti, we call this subtree a c/ij'W of t. (So, the term child can 
refer to a node or a tree, but there will be no confusion.) 

We define a function Pr which, loosely speaking, assigns to a family tree t = {N, L) 
its probabihty (see the assumption below). Assume that the root of t is labeled hy X.lft 
consists only of the root, and X ^ e, then Pv[t] = p; if the root has only one child (the 

P 

node 0) labeled by Y, and X ^Y, then Pr [t] = p - Pr [to]; if the root has two children 

(the nodes and 1) labeled by F and and X ^ (F, Z), then Pr[t] = p-Pr[to] -Priti]. 
We denote by Tx the set of all family trees whose root is labeled by X, and by Prx the 
restriction of Pr to 73f . We drop the subscript of Ptx if X is understood. 

Example 2.2. Figure 1 shows (a) a task system with F = {X, Y, Z}; and (b) a family 
tree t of the system with probabihty Pv[t] = 0.25 • 0.1 • 0.75 • 0.6 • 0.4 • 0.9. The name 
and label of a node are written close to it. 



£,X 



X^{Y,Z) 



0,F 



00, X m 01, Z • 



• l,Z 

10, y 



(a) (b) 
Fig. 1. (a) A task system, (b) A family tree. 



Assumptions. Throughout the paper we assume that a task system 
A = {r, Prob, Xq) satisfies the following two conditions for every type X € F: 
(1) X is reachable from Xq, meaning that some tree in Txo contains a node labeled 
by X, and (2) Pr[73f] = EteTx = ^- assume that (73f,Prx) is a 

discrete probability space with 73s: as set of elementary events and Prx as probability 
function. This is the formal counterpart to assuming that every task is completed with 
probability 1 . 

Proposition 2.3. It can be decided in polynomial time whether assumptions (1) and (2) 
are satisfied. 

Proof. (1) is trivial. For (2) let the probability generating function (pgf) of the task 
system be defined as the function / : — )• of A where for every X G F 

fx{v)= P-VY-VZ+ P-'Vy+ Yj P- 

X^{Y,Z) X^{Y) X^Hl 

It is well known (see e.g. [17]) that (2) holds iff the least nonnegative fixed point of / 
equals 1, which is decidable in polynomial time [15]. □ 
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Derivations and schedulers. Let t = {N, L) be a family tree. A state of t is a 
maximal subset of A'^ in which no node is a proper prefix of another node (graphically, 
no node is a proper descendant of another node). The elements of a state are called tasks. 
If s is a state and w & s, then the w-successor of s is the uniquely determined state s' 
defined as follows: if w is a leaf of N, then s' = s \ {w}; if w has one child wO, then 
s' = (s\{tu})U{wO}; if w has two children and tol, then s' = (s\{t(;})U{t(;0,t(;l}. 
We write s => s' if s' is the tw-successor of s for some w. A derivation oft is a sequence 
Si . . . =^ ,Sfe of states such that si = {e} and = 0. A scheduler is a mapping a 
that assigns to a family tree t a derivation a{t) of t. If a{t) = (si =>...=> s^), then for 
every 1 < z < A; we denote by a{t) [i] a task of Si such that s^+i is the (T(t) [i]-successor 
of Si. Intuitively, (T(t)[i] is the task of Sj scheduled by a. This definition allows for 
schedulers that know the tree, and so how future tasks will behave. In Section 4 we 
define and study online schedulers which only know the past of the computation. Notice 
that schedulers are deterministic (non-randomized). 

Example 2.4. A scheduler cti may schedule the tree t in Figure 1 as follows: {e} 
{0,1} ^ {0,10} ^ {0} {00,01} {01} ^ {}. Let era be the sched- 
uler which always picks the least unprocessed task w.r.t. the lexicographical order 
on {0,1}*. (This is an example of an online scheduler.) It schedules t as follows: 
{£} ^ {0, 1} ^ {00, 01, 1} ^ {01, 1} ^ {1} ^ {10} ^ {}. 

Time and space. Given X e .T, we define a random variable Tx, the completion 
time of X, that assigns to a tree t G Tx its number of nodes. Assuming that tasks 
are executed for one time unit before its generated subtasks are returned to the pool, 
Tx corresponds to the time required to completely execute X. Our assumption (2) 
guarantees that Tx is finite with probability 1, but its expectation E[Tx] may or may 
not be finite. A task system A is called subcritical if E[Tx] is finite for every X & T. 
Otherwise it is called critical. If A is subcritical, then E[Tx] can be easily computed 
by solving a system of linear equations [13]. The notion of criticality comes from the 
theory of branching processes, see e.g. [17,4]. Here we only recall the following results: 

Proposition 2.5 ([17,15]). Let A be a task system with pgf f. Denote by /'(I) the 
Jacobian matrix of partial derivatives of f evaluated at 1. If A is critical, then the 
spectral radius of f'{l) is equal to 1; otherwise it is strictly less than 1. It can be 
decided in polynomial time whether A is critical. 

A state models a pool of tasks awaiting to be scheduled. We are interested in the 
maximal size of the pool during the execution of a derivation. So we define the ran- 
dom completion space Sx as follows. If a{t) = {si => . . . => Sk), then Sx{t) ■= 
max{|si|, . . . , |sfc|}, where \si\ is the cardinality of Sj. Sometimes we write <S"^(t), 
meaning Sx {t) for the type X labelling the root of t. If we write S'^ without specifying 
the application to any tree, then we mean Sx . 

Example 2.6. For the schedulers of Example°2.4 we have (t) = 2 and S"^ {t) = 3. 
3 Optimal (Offline) Schedulers 

Let 5"'' be the random variable that assigns to a family tree the minimal completion 
space of its derivations. We call S°p {t) the optimal completion space of t. The opti- 
mal scheduler assigns to each tree a derivation with optimal completion space. In the 
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multithreading scenario, it corresponds to a scheduler that can inspect the code of a 
thread and decide whether it will spawn a new thread or not. Note that, although the 
optimal scheduler "knows" how the stochastic choices are resolved, the optimal com- 
pletion space S°P{t) is still a random variable, because it depends on a random tree. 
The following proposition characterizes the optimal completion space of a tree in terms 
of the optimal completion space of its children. 

Proposition 3.1. Let t be a family tree. Then 

f max{5°P(to) + l,S'°P(ti)},l 
mm < ^ , ^ , ^ ^ f ift has two children to, ti 

I max{5°f(to),5°^(ti) + l} J 



5*°^ (to) ift has exactly one child to 

ift has no children. 



[k) 



Proof sketch. The only nontrivial case is when t has two children to and ti. Consider 
the following schedulings for t, where i € {0, 1}: Execute first all tasks of ti and 
then all tasks of ti-u within both ti and ti-i, execute tasks in optimal order. While 
executing ti, the root task of ti-i remains in the pool, and so the completion space is 
s{i) = max{5°'' {ti) + 1, S°P{ti-i)}. The optimal scheduler chooses the value of i that 
minimizes s{i). □ 

Given a type X, we are interested in the probabiUties Pr[S'^^ < A:] for fc > 1. 
Proposition 3 . 1 yields a recurrence relation which at first sight seems difficult to handle. 
However, using results of [11,10] we can exhibit a surprising connection between these 
probabiUties and the pgf /. 

Let n denote the least fixed point of / and recall from the proof of Proposition 2.3 
that = 1. Clearly, 1 is a zero of f{x) — x. It has recently been shown that //, can 
be computed by applying to f{x) — x Newton's method for approximating a zero of a 
differentiable function [15, 20]. More precisely, /i = limfc_,.oo t'^'^^ where 

iv(")=0 and iy('=+i)=i/W + (/-/'(i/W))-i(/(i/W)-iy 

and f'iv'^^^) denotes the Jacobian matrix of partial derivatives of / evaluated at u'^^^ 
and / the identity matrix. Computing /it, however, is in our case uninteresting, because 
we already know that /x = 1. So, why do we need Newton's method? Because the 
sequence of Newton approximants provides exactly the information we are looking for: 

Theorem 3.2. Pr[5^^ <k]= for every type X and every k >0. 

Proof sketch. We illustrate the proof idea on the one-type task system with pgf f{x) = 
px^ + q, where q = 1 — p. Let T<k and T=k denote the sets of trees t with S°P{t) < k 
and S°P{t) = k, respectively. We show Pr[7<fc] = f^''^ for all k by induction on k. 
The case fc = is trivial. Assume that u^^^ = Pr [7<fe] holds for some fc > 0. We prove 

Pr[r<fe+i] = j/C'+i). Notice that 

Let S^'^j be the set of trees that have two children both of which belong to TLfc, and, 
for every i > 0, let B^^P be the set of trees with two children, one belonging to T<k, 
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the other one to By Proposition 3.1 we have T<k+i — Ui>o ^k+i- prove 



Pr 



— fiv'^^^y (/(j^^*^^ — i'*^'^-*) by an (inner) induction on i, which completes 
the proof. For the base i = 0, let A<k be the set of trees with two children in T<k', by 
induction hypothesis we have Pr[^<fc] = pv'^^^v'^^\ In a tree of A<k either (a) both 

children belong to T=k, and so t G ^^k+\' '^^ most one child belongs to T=k- By 
Proposition 3.1, the trees satisfying (b) belong to 7<fc. In fact, a stronger property holds: 
a tree of T<k either satisfies (b) or it has one single node. Since the probability of the tree 

withonenodeisg, wegetPr[^<fe] = Pr sfh +Pr[7<fc] —g. Applying the induction 



hypothesis again we obtain Pr 



jaio) 



■'k+l 



For the induction step, let i > 0. Divide B^'Ij^ into two sets, one containing the trees 
whose left (right) child belongs to B^j^]^-^ (to T<k), and the other the trees whose left 
(right) child belongs X.oT<k (to B^j^]^^). Using both induction hypotheses, we get that the 
probability of each set is pv^^^ f {v(^^y{f{v^^^^>) - v^^^). So Pr[s^!^^^^j = {2pv^^'>) ■ 
_ Since f{x) ^ px^ +q we have /'(i/^*^)) = 2pi/('=), and so 



Pr 



/c?(»+l) 

"k+l 



f>(^j^{k)y+i^f^^{k) _ ^ desired. 



Example 3.3. Consider the task system X ^ {X,X), X M- with pgf f{x) = 
px"^ + q, where p is a parameter and q = 1 — p. The least fixed point of / is 1 if 
p < 1/2 and q/p otherwise. So we consider only the case p < 1/2. The system is 
critical for p = 1/2 and subcritical for p < 1/2. Using Newton approximants we obtain 
the following recurrence relation for the distribution of the optimal scheduler, where 
Pk := Pv[S''P > fc] = 1 - j/C^-i): pk+i = ippl)/{l -2p + 2ppk). In particular, for 
the critical value p = 1/2 wegetpfe = 2^-'= and £[5°'^] = J2k>i Pt:[S°p >k]=2. 

Theorem 3.2 allows to compute the probability mass function of S°p. As a Newton 
iteration requires 0(1^1^) arithmetical operations, we obtain the following corollary, 

where by the unit cost model we refer to the cost in the Blum-Shub-Smale model, in 
which arithmetic operations have cost 1 independently of the size of the operands [6]. 

Corollary 3.4. Pr[S'^'' — k] can be computed in time 0{k-\r\'^) in the unit cost model. 

It is easy to see that Newton's method converges quadratically for subcritical systems 
(see e.g. [24]). For critical systems, it has recently been proved that Newton's method 
stiU converges linearly [20, 12]. These results lead to tail bounds for S^'- 



Corollary 3.5. For any task system A there are real numbers c > and < d < 1 

such that Pr [S°^ > k] < c ■ for all k G 'H. If A is subcritical, then there are real 
numbers c > and < d < 1 such that Pr[5'^^ > /c] < c - d^ for all k €N. 



4 Online Schedulers 

From this section on we concentrate on online schedulers that only know the past of the 
computation. Formally, a scheduler a is online if for every tree t with = (si 
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. . . =^> Sfe) and for every 1 < i < k, the task cr(t)[i] depends only on si =>...=> Si 
and on the restriction of the labelUng function L to Uj=i ^j- 

Compact Task Systems. Any task system can be transformed into a so-called compact 
task system such that for every scheduler of the compact task system we can construct a 
scheduler of the original system with nearly the same properties. A type W is compactii 
there is a rule X ^ (Y. Z) such that X is reachable from W . A task system is compact 
if all its types are compact. From now on we assume that task systems are compact. This 
assumption is essentially without loss of generahty, as we argue in Appendix C.2. 



4.1 Tail Bounds for Online Schedulers 

The following main theorem gives computable lower and upper bounds which hold 
uniformly for all onhne schedulers a. 

Theorem 4.1. Let A be subcritical. 

— Let v,w (1, oo)^ be vectors with f{v) < v and f{w) > w. Denote by v„iin 
and 

WjYiax least component of v and the greatest component of w, respectively. 
Then 

— < Pr fS"^ > A:] < for all online schedulers a. 

- Vectors v,w & {I, oo)^ with f{v) < v and f{w) > w exist and can be computed 
in polynomial time. 

Proof sketch. Choose /i > 1 and li e (0, oo)^ such that /i"^ = vx for all X & T. 
Define for all i > 1 the variable to^'^ = z^'^^-u where "•" denotes the scalar prod- 
uct, i.e., m*^*) measures the number of tasks at time i weighted by types according 
to u. One can show that /i"*'^' , /i™'^' , ... is a supermartingale for any online sched- 
uler cr, and, using the Optional Stopping Theorem [28], that Pr[supjTO(') > .x] < 
{vxo — 1)/ {h^ — 1) for all x (see the appendix for the details and [16, 26] for a sim- 
ilar argument on random walks). As each type has at least weight Umim we have that 
S"^ > fc implies supj m^*) > fctt„ii„. Hence Pr[S"^ > k] < Pr[supjm(*^ > kumin] < 
{vxq — ^)/{Vmin ~ The lower bound is shown similarly. □ 
All online schedulers perform within the bounds of Theorem 4.1. For an applica- 
tion of the upper bound, assume one wants to provide as much space as is necessary to 
guarantee that, say, 99.9% of the executions of a task system can run without needing 
additional memory. This can be accomplished, regardless of the scheduler, by provid- 
ing k space units, where k is chosen such that the upper bound of Theorem 4.1 is at 
most 0.001. 

A comparison of the lower bound with Corollary 3.5 proves for subcritical task 
systems that the asymptotic performance of any online scheduler a is far away from 
that of the optimal offline scheduler: the ratio Pr[5'^ > k] /Pr[S'°^ > k] is unbounded. 

Example 4.2. Consider again the task system with pgf / (x) = px"^ + q. For p < 1/2 the 
pgf has two fixed points, 1 and q/p.in particular, q/p> 1, so q/p can be used to obtain 
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both an upper and a lower bound for online schedulers. Since there is only one type 
of tasks, vectors have only one component, and the maximal and minimal components 
coincide; moreover, in this case the exponent /c + 2 of the lower bound can be improved 
to /e. So the upper and lower bounds coincide, and we get Pr[S"^ > k] = ^g/p-y^-i for 
every online scheduler a. In particular, as one intuitively expects, all onUne schedulers 
are equivalent.* 



4.2 Tail Bounds for Light-First Schedulers 

We present a class of online schedulers for which a sharper upper bound than the one 
given by Theorem 4. 1 can be proved. It may be intuitive that a good heuristic is to pick 
the task with the smallest expected completion time. If we compute a vector v with 
f{v) < V in polynomial time according to the proof of Theorem 4.1, then the type 
Xmin for which = Vmin holds tums out to be the type with smallest expected 

completion time. This suggests choosing the active type X with smallest component 
in V. So we look at t» as a vector of weights, and always choose the lightest active type. 
In fact, for this (intuitively good) scheduler we obtain two different upper bounds. 

Given a vector v with f{v) < v we denote by C a total order on F such that 
whenever X QY then vx < vy- If X QY, then we say that X is lighter than Y. The 
v-light-first scheduler is an online scheduler that, in each step, picks a task of the lightest 
type available in the pool according to v. Theorem 4.3 below strengthens the upper 
bound of Theorem 4. 1 for light-first schedulers. For the second part of Theorem 4.3 we 
use the notion of v -accumulating types. A type X ^ F is, t>-accumulating if for every 
fc > the w-light-first scheduler has a nonzero probabiUty of reaching a state with at 
least k tasks of type X in the pool. 

Theorem 4.3. Let A be subcritical and v e (l,oo)^ with f{v) < v. Let a be a 
v-light-first scheduler Let Vminmax '■= t^^^x^{y,z) TXiax{vY,vz} (here the mini- 
mum is taken over all transition rules with two types on the right hand side). Then 
and for all k > 1 

Pr[5- > fc] < "^^^ 



1 

Moreover, let Vminacc ■= m.iu{vx \ X & F, Xis v -accumulating}. Then 
Vminacc > Vminmax, Vminacc Can be computed in polynomial time, and there is an 
integer I such that for all k > i 



Pr[5<^ > fc] < ^f;_, ' 



■ v"-' - 1 

mm minacc 

Proof sketch. Recall the proof sketch of Theorem 4. 1 where we used that S" > k 
impUes supj rn^'^ > kumin, as each type has at least weight Umin- Let i be such that 
no more than £ tasks of non-accumulating type can be in the pool at the same time. 
Then S"^ > k implies supj m^*^ > £umin + {k — t)Uminacc which leads to the final 
inequality of Theorem 4.3 in a way analogous to the proof sketch of Theorem 4.1. □ 



' For this example PrfS'^ > fc] can also be computed by elementary means. 
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Intuitively, a light-first scheduler "works against" light tasks by picking them as 
soon as possible. In this way it may be able to avoid the accumulation of some light 
types, so it may achieve Vminacc > Vmin- This is illustrated in the following example. 

Example 4.4. Consider the task system with 2 task types and pgfs a; = a2xy + aiy + ao 
and y = b2xy + biy + bo, where 02 + ai + ao = 1 = 62 + &i + &o = 1- The system 
is subcritical if 0162 < 02^1 — 02 + bo- The pgfs have a greatest fixed point v with 
vx = (l-a2-&i-ai62+a2&i)/&2 andvy = (l-&i-&2)/(a2+ai&2-a2&i)- We have 
vx < vy iff 02 — 62 < 0261 — 0162, and so the hght-first scheduler chooses X before Y 
if this condition holds, and Y before X otherwise. We show that the light- first scheduler 
is asymptotically optimal. Assume w.l.o.g. vx < vy- Then X is not accumulating 
(because X-tasks are picked as soon as they are created), and so V-iYiifiacc — Vy • So the 
upper bound for the light-weight scheduler yields a constant C2 such that Pr[5'^ >k]< 
C2/vy- But the general lower bound for arbitrary online schedulers states that there is 
a constant ci such that Pr[S"^ > k] > ci/vy, so we are done. 

4.3 Tail Bounds for Depth-first Schedulers 

Space-efficient scheduling of multithreaded computations has received considerable at- 
tention [22, 7, 2, 1]. The setting of these papers is slightly different from ours, because 
they assume data dependencies among the threads, which may cause a thread to wait for 
a result from another thread. In this sense our setting is similar to that of [19], where, in 
thread terminology, the threads can execute independently. 

These papers focus on depth-first computations, in which if thread A has to wait for 
thread B, then B was spawned by ^ or by a descendant of A. The optimal scheduler 
is the one that, when A spawns B, interrupts the execution of A and continues with B; 
this onUne scheduler produces the familiar stack-based execution [7, 22]. 

We study the performance of this depth-first scheduler. Formally, a depth-first 
scheduler ax is determined by a function A that assigns to each rule r = X ^ 0^^^) 
either YZ or ZY. If A(r) = YZ, then Z models the continuation of the thread X, 
while Y models a new thread for whose termination Z waits. The depth-first scheduler 
ax keeps as an internal data structure a word w € F*, a "stack", such that the Parikh 
image of w is the multiset of the task types in the pool. Ifw = Xw' for some w' & F*, 
then a picks X. If a transition rule X ^ a "fires", then ax replaces Xw' by pw' where 
j3 = X{X a). 

Using techniques of |9] for probabilistic pushdown systems, we obtain the following: 

Theorem 4.5. Let A be subcritical and a be any depth-first scheduler Then 
Pr[5'^ = k] can be computed in time 0{k ■ \F\^) in the unit-cost model. Moreover, 
there is Q < p < 1 such that Pr[S''^ > fc] G 0{p''), i.e, there are c,C > such 
that cp'' < Pr[S"^ > k] < Cp'^ for all k. Furthermore, p is the spectral radius of a 
nonnegative matrix B G , where B can be computed in polynomial time. 

While the proof of Theorem 4.5 does not conceptually require much more than the 
results of [9], the technical details are deUcate. The proof can be found in the appendix. 
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5 Expectations 



In this section we study the expected completion space, i.e., the expectation EIS''^] for 
both offline and onHne schedulers. Fix a task system A = (/", Prob, Xa). 

Optimal (Offline) Schedulers. The results of Section 3 allow to efficiently approxi- 
mate the expectation K[S°p]. Recall that for any random variable R with values in the 
natural numbers we have E[R] — J2iLi > So we can (under-) approximate 
E[R] by X^Li Prfi? > i] for finite k. We say that k terms compute b bits ofE[S"P] if 

EM-E-=:o(i-4!,)<2-^ 

Theorem 5.1. The expectation E[S"'^'] is finite (no matter whether A is critical or sub- 
critical). Moreover, 0{b) terms compute h bits o/E[S'°p]. If the task system A is sub- 
critical, then log2 b + 0(1) terms compute b bits o/E[5'°^]. Finally, computing k terms 
takes time 0{k ■ \r\^) in the unit cost model. 

Online Schedulers. The main result for online schedulers states that the finiteness 
of E[S'^] does not depend on the choice of the online scheduler a. 

Theorem 5.2. If A is subcritical, then E[S'^] is finite for every online scheduler a. If 
A is critical, then E[S"^] is infinite for every online scheduler a. 

Proof sketch. The first assertion follows from Theorem 4.1. Let A be critical. For this 
sketch we focus on the case where Xq is reachable from every type. By Proposition 2.5 
the spectral radius of /'(I) equals 1. Then Perron-Frobenius theory guarantees the 
existence of a vector u with f'{l)u = u and Ux > for all X. Using a martin- 
gale argument, similar to the one of Theorem 4.1, one can show that the sequence 
m^^\m^'^\. . . with m'*' := z^*^ is a martingale for every scheduler a, and, us- 
ing the Optional-Stopping Theorem, that Pr[S''^ > k] > Uxo/(k -\- 2). So we have 

ns"] = EZi p^s- >k]> Er=i ^xj{k + 2) = oo. □ 

Since we can decide in polynomial time whether a system is subcritical or critical, 

we can do the same to decide on the finiteness of the expected completion time. 

Depth-first Schedulers. To approximate E[S"^] for a given depth-first scheduler a, 
we can employ the same technique as for optimal offline schedulers, i.e., we approx- 
imate E[S"^] by Ei=i Pr[S"^ > i] for finite k. We say that k terms compute b bits of 

ElS"] if E[S"^] - PriS"" >i]< 2-''. 

Theorem 5.3 (see Theorem 19 of [9]). Let A be subcritical, and let a be a depth-first 
scheduler Then 0{b) terms compute b bits o/E[S"^], and computing k terms takes time 
0{k ■ \r\^) in the unit cost model. 

6 Conclusions 

We have initiated the study of scheduling tasks that can stochastically generate other 
tasks. We have provided strong results on the performance of both online and offline 
schedulers for the case of one processor and task systems with completion probabil- 
ity 1. It is an open problem how to compute and analyze online schedulers which are 
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optimal in a sense. While we profited from the theory of branching processes, the theory 
considers (in computer science terms) systems with an unbounded number of proces- 
sors, and therefore many questions had not been addressed before or even posed. 

Acknowledgement. We thank the referees for their helpful comments. 
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A Proofs of Section 2 



A.l Proof of Proposition 2.5 



Proposition 2.5 ([17,15]). Let A be a task system with pgf f. Denote by /'(I) the 
Jacobian matrix of partial derivatives of f evaluated at 1. If A is critical, then the 
spectral radius of f'{l) is equal to 1; otherwise it is strictly less than 1. It can be 
decided in polynomial time whether A is critical. 

Proof. One can show (see e.g. [14]) that E[Tx] is the X-component of the least non- 
negative fixed point of f'{l)x + 1, i.e., the X-component of the (componentwise) 
least vector x € [0,oo]^ with x = f'{l)x + 1. This least fixed point is given 
by X]i^o(/'(-'^))*-'-' ^ series that may or may not converge. It is a standard fact (see 
e.g. [18]) that the series converges iff < 1 holds for the spectral radius 

p(/'(l))of/'(l). 

Assume first that A is subcritical. Then the above series must converge, so we have 

< 1 in this case. Now assume that A is critical. Then the above series must 
diverge, so we have > 1. On the other hand, in [12, 15] it is shown that 

< 1. (More precisely, it is shown there that p{f'{y)) < 1 holds for y that are 
strictly less than the least fixed point of /. By continuity of eigenvalues, p{f'{y ) ) < 1 
also holds for the least fixed point of / which is 1 according to the proof of Proposi- 
tion 2.3.) Hence we have p{f'{l)) = 1. 

In order to decide on the criticality, it thus suffices to decide whether the spectral 
radius of /'(I) is > 1. This condition holds iff > x holds for a normegative, 

nonzero vector x (see e.g. Thm. 2.1.11 of [5] and cf. [15]). This can be checked in 
polynomial time with linear programming. □ 



B Proofs of Section 3 



B.l Proof of Proposition 3.1 



Proposition 3.1. Let tbe a family tree. Then 



max{5°P(io) + l,S'°P(ti)}, 
uiax{S'>P{to),S°P{ti) + l} 



1 



ift has two children to, ti 

ift has exactly one child to 
ift has no children. 



Proof. Recall the proof sketch from the main body of the paper. We detail the argu- 
ment why one of the two given scheduUng strategies is optimal, i.e., we argue why the 
scheduler cannot save space by interleaving the schedulings for to and ti. 

Consider an optimal scheduling of t. W.l.o.g. the task to terminates first. Then at 
least one ti-task sticks around during the whole derivation of to. So this scheduUng 
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needs space of at least S°P{to) + 1. Obviously, any scheduling of t needs space of 
at least S°P{ti). So the optimal scheduler needs space of at least max{S°P{to) + 
1, S°P{ti)}. But this lower bound is matched by the scheduUng strategy given in the 
main body of the paper. □ 



B.2 Proof of Theorem 3.2 



Theorem 3.2. Pr[S^ <k]= f'^x every type X and every k >0. 
Proof. Let us inductively define the function £ on trees as follows. 



m ■■- 



ift has no children 

£{to) + 1 ift has one child 

e{to) + 1 if i has two children and S"'P(to) > S''P{ti) 

e{ti) + 1 if i has two children and S'°P (to) < S°P{ti) 

if t has two children and S°P{to) = S°P{ti) . 



With Proposition 3.1, i{t) is the length of a longest path from the root to a descendant 
with the same S^^-value. 

We proceed by induction on k. The base case = is trivial. Let k >0 and let t be 
an X-tree with S^Pit) =k + l.We have to show Pr[S^ =k+l\ = where 



4=0 

We show the following stronger claim: 

Pvis^x^it) = k + i, eit) = i] = (f'iu^'^^f (fii^^"^) - • 

We proceed by an (irmer) induction on i. For the induction base i = we first dispense 
with the case k = 0. We have 

Pr[S'^P(t) = 1, £{t) = 0] = Pr[t has no children] 

because if t has one child then £{t) > 1 and if t has two children, then S'^{t) > 2. 
With the definition of / we obtain 

PT[S°x'{t) = 1, £{t) = 0]=J2p- fxW = fx{^^'^) - ■ 

Now we complete the induction base i = with the case fc > 1. We have 

Pr[S'^^{t) = k + l, £{t) = 0] = Pi[t has two children, 3°" {to) = S°P{ti) = k] 

(1) 
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because if t has one child, then £{t) > 1, and if t has no children, then S°^{t) = 1. 
Further we have by Proposition 3.1 

Pr[5^^(t) < fc] = P- (Pr[^F (*o) < k] Pr[S'/ih) < k] 

X^{Y,Z) 

-Pr[5°^'(io) = fc] Vr[Sl^{H) = fc]) (2) 
+ P-Pr[5°^(io)<fc] 

+ E p- 

Combining these equations we obtain 

Pr[S°/{t) = k + l,m = 0]= Y P-PAS^''{to) = k]Pv[S°/{h) = k] (by(l)) 

X'^{Y,Z) 

= Yl P-Pr[S^''{to)<k]Pr[S"/{h)<k] (by (2)) 

X-^{Y,Z) 

+ Y P-Pr[5°P(io)<fc]+ Y P 

p p 

X^Y X^e 

-Vr[S°^{t) < k] 
= Y^ P'^Y^^^z^ (ind. hyp. on fc) 

X^(Y,Z) 

+ E p--p+j:p 

p p 

X^Y 

(fc) 

= (def.of/) 

For the induction step, let i > 0. Then by Proposition 3.1 and the definition of i 

Pr[5j(<) = fc + l, i{t)=i + l] 
= Y P-{PASY''{to)<k]Pv[S"/{h) = k + l, e{h) = i] 

X'^{Y,Z) 

+ Pr[^7(to) = fc + 1, i{to) = i] Pr[S°/{h) < fc]) 
+ Y P-Pr[^7(*o) = ^ + l> nto)=i] 

P 

X^(Y,Z) 
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+ (/'(i^^'^)' (/(i^*'^) (ind.hyp.onfc,i) 

P 

Yer 

□ 

B. 3 Proof of Corollary 3.5 

Corollary 3.5. For any task system A there are real numbers c > and < < 1 
such that Pr > fc] < c • rf^ for all k € N.IfA is subcritical, then there are real 
numbers c > and < d <1 such that Pr[S^ >k]<c-(f for all € N. 

Proof. By Theorem 3.2 we have Pr[5'°P > k] = l—v^^u — l — So the corollary 
can be understood as a statement on the convergence speed of Newton's method for 
solving X = f{x). The fact that Newton's method started at converges to 1 (the least 
fixed point of /) is shown in [15]. 

For the subcritical case, observe that the matrix / — /'(I) is nonsingular because 
otherwise 1 would be an eigenvalue of /'(I) which would, together with Proposi- 
tion 2.5, contradict the assumption that the task system is subcritical. For nonsingular 
systems, it is a standard fact (see e.g. [24]) that Newton's method converges quadrati- 
cally. As I>v[S''p > fc] < 1 - v^^lr statement follows. 

For the general case (subcritical or critical) Newton's method for solving x = f{x) 
has been extensively studied in [20, 12] and it follows from there that there is a ci G 
(0, cx)) such that 1 — i^^^ < ci • 2^'^/'"^"' where n = \r\, implying the statement. 

C Proofs of Section 4 

C. l A Characterization of Online Schedulers 

For proofs involving online schedulers a, it is convenient to work with a function A^- 
(defined below) which essentially characterizes a. To define it, fix an online scheduler a. 
For every tree t with a{t) = (si s^) and for every j > 0, let z^^^^ (t) denote 

the multiset of types labelling the tasks of Sj if j < (i.e., z^^^ (t) = {L{w) \ w G Sj)), 
and the empty multiset otherwise. One can show that an online scheduler a induces a 
partial function : (N^)* — >■ F defined as follows: Aa{c^^'> . . . c^*)) is defined if there 
is a tree t such that a{t) = (si s^) with k >i and c^^^ = z^^^ {t),. . . , c^*) = 

z^'^(t); in this case ylcr(c^^^ . . . c^'^) = L{a{t)[i]). Intuitively, if A^ gets as input the 
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multisets of types of the states si,. . . ,Si, then it returns the type of the task of Si 
picked up by the scheduler. Let X^^^ = Aa{z^^\ . . . , z^^^), i.e., X^'^ is the type picked 
up at the i-th step. Then X^^^ is randomly replaced by new types according to the 
distribution on the transition rules. More precisely, if r^*) := z^'+^^ + X^'^ — z^'\ then 
PrTrW =a I =Xl =y; p p. 

L I J ^x^a^ 

We will show the following proposition, which allows us to identify an online sched- 
uler (T with the function A^^. 

Proposition C.l. Let (ti, 0-2 be online schedulers. If A^-^ = A^^, then Pr[S"^^ = A;] = 
Pr[5'"2 = k] for all k > 1. 

Lemma C.2. Let a be an online scheduler For every family tree t the first i>l states 
of a{t) are uniquely determined by z'^^\t), z^^\t). In particular, the function Acr 
is well-defined. 

Proof. We proceed by induction on i. The case z = 1 is trivial. Let us consider 

^(^^(t), . . . , z(*+^)(t), and let d = (si =>•••=> Si Si+i) be a prefix of 
the derivation <T{t). By induction, si • • • ^ Sj is completely determined by 
z(^^(t), . . . , z^'^(t). By the definition of online scheduler, cr(t)[i] is completely deter- 
mined by si => • ■ • ^ ,s,; and z'^^^t). . . . , z'-^^{t). Finally, there is a unique transition 
rule L{a{t)\i]) ^ a where a = z^'+^^t) - z'^'\t) + {L{a{t)\i])). But then Si+i is 
also uniquely determined. □ 

Lemma C.3. Let c^^^'> ■ ■ ■ c^^'> e (N^)+ such that for every I < j < i 
the value A„{c'^^^ ■ ■ ■ c'^^^) is defined. Then /\]^^ z'^^'' = c^^^i 

^ aj) where for every 1 < j < i we have 
a- = cW+i) - c(^) + (7l<,(c(i) • ■ • c(^'))). 

Proof. Let us denote by TZ the set of all family trees t such that z^^^{t) = c^^^ for 
1 ^ J < *• By Lemma C.l, there is a derivation d = si ^ ■ ■ ■ ^ Si and a function 
/ : Uj=i ^ r such that for every t = {N, L) G TZwe have that c? is a prefix of a{t) 

and I coincides with I on the subtree -^j- Let us denote by the tree ljj=i Note 
that is a subtree of every tree of TZ rooted in e. Let us denote by I the set of all inner 
nodes of t^. For every w G I, we denote by child{v) {l{va) \ a £ {0, 1}, va G f^) 
the multiset of labels of children of the node u in i'^. Let us denote by £ the set of all 
leaves of t*. It follows directiy from the definition of Pr, that for all f e 7^ we have 

PT[t] = Yl Prob{L{v) ^ child{v)) ■ Pr[t^] 

However, it follows directiy from definitions that for every v £l there is precisely one 
1 < j < i such that cr(f)[j] = v, and then = A^{c'^^'> ■ ■ ■ c^^^) and child{v) = Uj. 
Therefore, 

Pr[t] = Prob{A^{c^^^ ■ ■ ■ c^^^) ^ aj) ■ J| Pi[ty] 
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Finally, 

i-1 



i-1 



(1) ...^0)^ 



teiz 



vec t'eTnv 



Now we can prove Proposition C.l. 

Proof (of Proposition C.l). We denote by z^*-* the variable 2;^*) evaluated with respect 
to a given scheduler A. Let us denote by A^ef the set of all c*^^) 



such that A^^ (c^^) • ■ • c^^)) = yl^., (c^^) ■ • ■ c^^')) is defined for all 1 < j < i - 1, and 
c^') = 0. By Lemma C.3, for every c^^^ ■ • ■ c^*^ e A^e/ we have 



Pr 



Yl Prob{A^,{c^^'> ■ • •c(^)) aj) 



3=1 

i-1 



A 



However, then Pr[S"^i = k] = Pi-[S'^^ = k] because the values of S"^i and S"^ are 



(1) ^(2) 



determined by the values of 2;^/ , z 



and I ^'^2 ) • • ^nd for all family trees t 



we have that a prefix of zi-^ (t) , Zct""/ (t) , ■ ■ ■ and a prefix of zJrV (t) , z)yi (*),••• are in 

Adef- □ 



.(2) 



.(2) 



C.2 Justification for Compactness 

In Section 4 we claimed that we can focus on compact task systems essentially without 
loss of generality. We justify this claim now. 

A non-compact task system can be compacted by iteratively removing all rules with 
non-compact types on the left hand side, and all occurrences of non-compact types on 
the right hand side. 

Proposition C.4. Let us denote by F' the set of all task types removed from A by the 
above compacting procedure and let \r'\ = I. IfX^ e F' , then there is a scheduler a 
such that < £. 

Assume that Xq ^ F'. Let A' be the compacted version of A( i.e., F\F' is the set 
of task types of A'). Every scheduler a' for A' can be transformed into a scheduler a 
for A such that for all k 



Pr 



> k 



< Pr[5'"'^ > fc] < Pr 



^c^',/l' 



>k-e 



(The second superscript ofS indicates the task system on which the scheduler operates.) 
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Notice that computing a from a' is easy: a acts like u' but gives preferences to the 
types that have been (first) ehminated during the compacting procedure. 
Now we prove Proposition C.4. 

Proof. Let A\ be a non-compact task system with a non-compact types Fnon, and let 
Aq be the (possibly non-compact) task system obtained from A-y by removing all rules 
with non-compact types on the left hand side and all occurrences of non-compact types 
on the right hand side of all rules, i.e., Aq is obtained from A\ by performing the 
first iteration of the compacting procedure. Let ctq be a scheduler for Aq. Construct a 
scheduler a\ for A\ as follows: 



The scheduler a\ acts exactly like ctq until one or two rnon-tasks are created 
at which point the completion space of the derivation may be increased by 
at most 1. Then a\ picks a /^„on -task, say n. Since the -Tnon-types are non- 
compact, a\ can complete t\ without further increasing the completion space. 
After Ti has been finished, there may be another r„o„-task left, say T2, that was 
created at the time when t\ was created. If there is such a ti, then ai completes 
T-2 in the same way it has completed t\. After t\ (and possibly T2) have been 
completed, a\ resumes to act like ctq. 

It follows from this construction that the incorporation of the non-compact type -r„o„ 
increases the completion space of a derivation by at most 1. 

A straightforward induction on this construction shows for the statement of the 
proposition: 

Pr [5^''^' < fcl < Pr [55-'^ < fc ^1 for all X e r \ r'. 



If g r', then the above construction also works. (It extends a scheduler op- 
erating on a possibly empty task system, but this poses no problems.) So, again by 
induction, we obtain a scheduler a for A with < I for all X £ F' . 



It remains to show the inequality Pr 



Si'"^' > k 



< Pr 



S 



X 



> k 



, but this is 



clear because A' is obtained from deleting rules and types from A and a is obtained by 
extending a'. □ 



C.3 Proof of Theorem 4.1 

We split the proof in several lemmata. With regard to the computation of a suitable 
vector V we first prove the following lemma. 

Lemma C.5. Let u e [1, 00)^ denote the vector of expected completion times, i.e., 
uy = E,[TY]forallY € F. Then u exists and is the unique solution of x = f'{l)x+l. 
Let Q{u,u) denote the "quadratic part" of f{u), i.e., {Q{u,u))^ ='Y^ p p-uy 

Uz for all X,Y, Z € F. Let s := 1/qmax > where Qmax is the largest component 
ofQ{u, u). Then for allr >0 we have /(I -|- ru) <l + ru iffr < s. 



20 



Using this lemma a suitable v can be found as follows: First compute u by solving 
X = /'(l)a; + 1. This yields Q{u, u), and, consequently, s. With regard to the upper 
bound of the theorem we are interested in a w which is as large as possible, so pick 
V := 1 + su. All steps can be performed in polynomial time. 

Proof of the lemma. The fact that u = + 1 exists and is the vector of ex- 

pected completion times follows from the remarks made at the beginning of the proof 
of Proposition 2.5. Recall that the pgf / is a vector of polynomials of degree 2 with 
positive coefficients. So it can be written as 

f(x) = Q{x, x) + Lx + c 

where Q{x, x) is the quadratic part of f{x). A straightforward calculation shows for 
all r e M and a; e 

/(I + rx) = /(I) + rf'{l)x + r^Q{x, x) (Taylor expansion) 

= l + rf'{l)x + r^Q{x,x) (as /(I) = 1) . 

For u = f'{l)u + 1 it follows 

/(I + ru) = l + r{u - 1) + r'^Qiu, u) , 

so we have /(I + ru) < 1 + ru iff rQ{u, u) < 1. The statement follows. □ 
Next we show how a suitable w can be found. 

Lemma C.6. One can compute in polynomial time a vector w G (l,oo)^ with 
f{w) > w. 

Proof. Using the Taylor expansion of /(I + rx) as in the previous lenmia, we obtain 
f{l+rx) > 1 + rxiff 

rQ{x,x)>{I - f'{l))x. (3) 

We will choose w := 1 + rx, so we need to find suitable r and x such that (3) holds. 
Define y S {0, 1}^ such that = 1 if the X-component of Q{x, x) is not constant 
zero (or, equivalently, if there is a rule X {Y, Z) for some Y,Z e F). Otherwise, 
i.e., if fxi^) has degree 1, set i/x = 0. Define x := f'{l)*y = (/ — f'{l))~^y. 
By the compactness of the task system, all types can reach a type X with yx = 1. It 
follows that / (l)*y is positive in all components. Hence, Xmin > where 

*^mm is the 

smallest component of x. 

Observe that (/ - / (l))a; = y, so (3) holds at least for the components X with 
yx = 0. Let c denote the smallest nonzero coefficient of /. Equation (3) holds also for 
the components X with yx = 1 if we set r > l/(c • Xmin)- The statement follows. □ 

To complete the proof of Theorem 4.1 it remains to show the claimed bounds 

onPr[S"^ > k\. 

Theorem 4.1. Let A be subcritical. 
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- Let v,w € (1, cxd)^ be vectors with f{v) < v and f{w) > w. Denote by Vmm 
and Wmax the least component of v and the greatest component of w, respectively. 
Then 

— — < PrfS"^ > k] < -^^^ — for all online schedulers a. 

"J max -L ^mm 

- Vectorsv,w€ (l,oo)'" with f{v) <vandf{w) > w exist and can be computed 
in polynomial time. 

Proof. The second assertion follows from Lemmas C.5 and C.6. It remains to show the 
first assertion. 

Let /i > 1 and u G (0, oo)^ such that /i"^ = for aU F e T. Define m^*) := 
• ti where "•" denotes the scalar product. Not that m^^' = uxo ■ 
Let us consider i > 1. Let y = c*^^) , • ' " i c*^*) be a sequence of elements of with 
c^*) ^ 0, and let Ty be the set of aU family trees t satisfying z*^-^^ (t) = c^^' for every 
1 < J < Note that m^*) (t) ^ 0. Observe that m^'^ is constant over Ty, we denote by 
rn^*) (Ty) its value over Ty. 

An easy computation reveals that for F := A„ (y) we have 



E 




Ty 


= E 






= E 


n «?' 


Ty 










.zer 











(4) 



as /(v) < Consequentiy, we have 



E 



==E 
= E 

= E 



/i^*''*" I T 



= /i"^'H^«) -E 



•E 
,(*). 



■E 



/J-(^^(!/)>■^ 



(def.ofm('+^^) 
(def.ofr*'') 

\const. on Ty 
(def.ofmW). 

(Equation (4)) 



As this is true for all onUne schedulers a and also E [m(*+^) | m^*) = O] = we have 



E 



i.e., the sequence /i™*^' , /i™*^' , ... is a supermartingale. 

Define the stopping time Tk := inf{z > 1 | m^'^ G {0} U [fc, oo)}. Note that 
m('^'=) < fc + 2MTOa2:, and hence that m(^'=) e {0} U [k,k + 2u„mx]- We wish to 
apply Doob's Optional-Stopping Theorem [28] (sometimes called Optional-Sampling 



Theorem) to infer that E 



< E 



= vxn ■ To this end we define the 



sequence m^^^ , m^^' , • • • by setting m^'^ := m^'^ for i <Tk and m^'^ := m^'^''^ for i > 
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Tfc. The sequence /i™'^' , /i'"'^' , ... is a martingale as /i™*^' , /i™*^' , ... is a martingale. 



To apply the Optional-Stopping Theorem we also need to make sure that 



is bounded by a constant, which is the case as m*^*^ G [0, fc + 2umax] for all i. 
Define the stopping time := inf{« > 1 | m^*' G {0} U [k, oo)}. Doob's Optional- 
Stopping Theorem now yields 



E 





= E 




< E 




= E 



















= = VXo ■ 



Let, as an abbreviation, pk := Pr [m^'^'') > A;] . Then we have 



which gives 



> /l° ■ (1 - Pfe) -h /l'^ • Pfe = 1 - Pfe /l'^ ■ Pfc 



Letting \z^^^\ denote the sum of the components of z^^\ and Umin the smallest compo- 
nent of u, we have 



PrlS" > /c] =Pr 



sup 



< Pr 



sup 



Pkv 



< 



VXq - 1 

(5) 

So we have shown the upper bound. 

For the lower bound we redefine h and u such that h"^^ = wy for all F e P which 
allows to show in an analogous way that 



E 



> h' 



i.e., the sequence ft.™*^' , /i™*^' , ... is now a submartingale. The Optional-Stopping The- 



orem now yields E 



which gives 



> wxo- Further we now have 



■Pk 



Pk > 



WXq - 1 



and thus 

PiiS" > fc] = Pr 



sup|zW| > k 



> Pr 



supm^*^ > kUr, 



WXq - 1 

■ PkUma. ^ k+2 _ 



23 



C.4 Proof of Theorem 4.3 



We first prove the following proposition. 

Proposition C.7. The set of v -accumulating types can be computed in polynomial time. 

Proof. We start with some notations. By =>* we denote the reflexive and transitive 
closure of We use "+" for multiset union. We say that X can generate a multiset 
a, denoted by X =^ a, if some multiset containing a can be derived from X , i.e., if 
X =>* a + ^ for some multiset j3. We write Y aifY can generate a using only 
X-bounded rules, i.e., rules Z ^ B such that Z < X, and Y ^=>if a to denote that the 
light-first scheduler can generate a. Finally, we denote by a-^ (a^^) the restriction 
of a to types Y > X (Y > X). 

We prove the following characterization: X is t)-accumulating iff there is Y such 
that Xo ==> Y and Y =^y X + Y. This immediately leads to a polynomial algorithm. 

(^): Assume X is v-accumulating. Then Xo =^;/ n ■ X holds for infinitely many 
n > 1. We claim that there exists a type W such that W =^x n ■ X for infinitely many 
n > 1. For the claim, take the longest suffixes of the witnesses for Xq ==^if n ■ X 
that only use rules X-bounded rules, and let a„ be their corresponding initial multisets. 
These suffixes are then witnesses for a„ =>x n • X. By the maximality of the suffixes, 
either a„ = Xq holds for infinitely many n > 1, or a„ = does. In the first case, 
we take W := Xq. In the second case, let Z„ ^ /3„ be the rule applied to obtain a„. 
Then 

Xq (an - Bn) + Zn ^If (an - Bn) + Pn =^X n ■ X 

where X < Z„. Since the step (a„ — + Z^ ^if — Pn) + is hght-first and 
X < Zn, we have (a„ — /3„) = (q;„ — I3n)^^ , and so there are infinitely many n > 1 
such that (3n =^x n- X. Since < 2 for all n, the type W exists, and the claim is 
proved. 

Consider now a witness of W =^x n ■ X for some n > 2*^ + 1, where k is the 
number of types. The corresponding tree has depth at least fc + 1, and so it contains a 
path in which some type Y appears twice. This easily leads to Y =^x X + Y for some 
type Y such that Xq =^ Y. 

{<=): We start with some simple properties of the relations =^x ^tf- 

(1) If y ==^x " tind a = a--^ , then Y =^ij a. 

Consider a family tree having a (prefix of a) derivation that witnesses Y =^x ol- 
So all ancestors of the nodes corresponding to a are labeled by symbols that are 
< X. It follows that a light-first scheduler may select aU ancestors of the a-nodes 
before selecting any a-node. Hence Y =^if a. 

(2) If X =^ y and y =^j/ /3, then X p. 

X ^Y implies X =^*^ y + a for some q:, and y =^;y ft implies Y jft + fti 
for some /?! . As X =^*if Y -\-a., it suffices to find a derivation witnessing y + a =^>^ 
that reaches a multiset of the form ^ -|- 7 for some 7. Such a derivation is obtained 
by interleaving the witnesses for Y P + Pi =>*j and a =>y: 0. 

Assume now that Xq ^ y and Y =^x X + Y hold. Then Y ^x n • X for 
every n > 1. Now (1) yields Y =^if n ■ X, and (2) leads to Xq =^;/ n ■ X, also for 
every n > 1. So X is t;-accumulating. □ 
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Now we complete the proof of Theorem 4.3. 



Theorem 4.3. Let Abe subcritical andv G {l,oo)'^ with f{v) < v.Letcrbeav-light- 
first scheduler. Let 

Vfninmax ' — miiij^<^^y^2) ^^ix^vy , 'U^]- (here the iTiinimufn is 
taken overall transition rules with two types on the right hand side). Then Vr, 
Vmin and for all k >1 

vx„ - 1 



P^S" >k]< 



Moreover, let Vminacc '■= ra.m{yx \ X & F, Xis v-accumulating}. Then 
Vminacc > Vminmax, Vminacc Can be Computed in polynomial time, and there is an 
integer t such that for all k> £ 



PrlS" >k]< 



- v''-^ - 1 

mm minacc 



Proof. The inequahty Vminmax > Vmin is trivial. For the inequality Vr 
Vminmax, let Li := {Y & T \ vy < Vminmax} be the Set of types that are strictly 
lighter than v„iimyiax- We claim that, in each step i, there is at most one task of ii-type. 
More formally, if e*^^'^ denotes the vector with e^*' = 1 for y G Li and e^*' = 
for Y ^ Li, then we have z(') 'e^^'^ < 1 for all i. This can be shown by a straight- 
forward induction on the derivation length: at each step the task of Li-type (if present) 
is selected and replaced by at most two tasks. By definition of Vminmax, at most one 
of the new tasks has ii-type. Hence, the types in Li are not accumulating. It follows 

Vminacc ^ Vminrnax- 

The rest of the proof is obtained by a small modification of the proof of The- 
orem 4.1: it suffices to show that, in Equation (5), we can replace kumin by 
Umin + {k- l)uminmax and by lUmin + (k - e)u„,inacc for somc integer £. (The 
values Uminmax and Uminacc are defined in the obvious way, i.e., using the h from 
the proof of Theorem 4.1 we have /i"™f"na» = Vminmax and /I'^^f^cc = Vminacc-) 
So we need to show for the light-first scheduler a that l^;*^*^! > k implies both 

TTl^ ^ > Umin ~t~ ^)'^minmax and TTl^ ^ ^ £'^min ~t~ £^'^minacc' 

For the first implication, recall that m^'^ = z^'' 'u. We have argued above that 
^(0 .e(i^) < 1. This implies m'^^ > Umin + {k - l)ur. 



''mmmax- 



For the second implication, let £' be an integer such that < I' for aU i and for 
all non-accumulating types Y. Let £ := \r\ ■ £'. Then in each step, there are at most £ 
tasks of non-accumulating type. This implies m^'^ > £umin + [k — £)uminacc- □ 



C.5 Proof of Theorem 4.5 

In the following we let M* := I + M + MM H for any square matrix M. If M* 

converges, then, by basic matrix facts, it equals (/ — M)^^. Also by basic matrix facts 
(see e.g. [18]), M* converges iff the spectral radius of M is less than one. 

Define for all vectors u. v the vectors L{u) and v) such that for all X G F 

L{u)x ^ puy and Q{u,v)x ^ puyuz ■ 

p V 

X^Y X^YZ 
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Note that the sums extend over the rules after applying A. Also note that L is a linear 
vector function and we view it as a matrix whose rows and columns are indexed with F. 
Furthermore, we write Q{-,v) andQ{u, •) for the matrices with Q{-, v)u = Q{u, v) = 
Q{u,-)v. 

Here is a restatement of Theorem 4.5: 

Theorem 4.5. Let A be subcritical and a be any depth-first scheduler. Then 
Pr[S"^ = k] can be computed in time 0{k ■ \r\^) in the unit-cost model. Moreover, 
there is Q < p < 1 such that Pr[S"^ >k]€ 0{p''), i.e, there are c, C > such 
that cp^ < Pr[S'^ > k] < Cp^ for all k. Furthermore, p is the spectral radius of a 
nonnegative matrix B G M^^^, where B can be computed in polynomial time. 

We first prove the first part of Theorem 4.5 . In fact, the following proposition allows 
to compute Pr[S'^ > A;] for aU X e -T at the same time. We define, for all fc > 1, the 
vector s[k] G [0, 1]^ such that s[k]x = Pr [^J- > k] for all X. 

Proposition C.8. Let A[k] := L -\- Q{1 — s[k], ■). Then {I — A[k])~^ exists and for all 
fc > 1 

s[k + 1] = A[k]s[k + 1] + Q{; l)s[k] = {I- A[k])-^Q{; l)s[k] . 
Proof. The following equation follows from the definition of a depth-first scheduler a. 

Py:[S^ > fc + 1] = Yl PPi"[5f > fc + 1] 

p. 

+ J2 p(Pr[S'r >fc] + Pr[5^ <fc]-Pr[55 >fc + l]) 

p 

X^YZ 

Using the definitions this immediately impUes the equaUty 

s[k + 1] = A[k]s[k + 1] + Q{; \)s[k] . 

For the second equaUty of the proposition, note that /'(I) = L -\- Q{\,-) -\- Q{-,1). 
As the task system is subcritical, the spectral radius of /'(I) is, by Proposition 2.5, less 
than one. So the spectral radius of A[k] < L-\- (5(1, •) < /'(I) is less than one as well. 
Hence, by standard matrix facts [18] the sum A[k]* converges and equals {I — A[k\)~^. 
The second equaUty follows. □ 

Notice that Proposition C.8 in fact implies the first statement of Theorem 4.5, because 
Pr[S"^ = fc] = s[fc]j!i:o — s[k — l]xo and a matrix can be inverted in time Od-Tl^) in the 
unit-cost model. 

For the rest of the proof of Theorem 4.5 we need the following two auxiUary lem- 
mata. 

Lemma C.9. Let Abe a nonnegative square matrix with spectral radius less than one. 
Let (e„)„£N be a sequence with e„ > e„+i > converging to 0. Then there exists an 
n\ and a nonnegative matrix K such that for all n>n\ 

{{l-en)A)* >{I-enK)A* . 
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Proof. We can assume e„ < 1. Let M = [I — A) ^A. Then by a simple computation 

((l-e„)^)* = (7 + e„M)->. 

Choose ni large enough so that p{enM) < 1. Then (e„M)* exists and so 

(7 + e„M) = 7 - (e„M) + (e„M)2 - (e„M)3 + - • • • 
>7-(e„M)(e„M)* 
>7-e„M(e„,Mr 

Choose = M(e„i M)* and the claim follows. □ 

Lemma C.IO. Let B := {I — L — <5(1, ■))~^Q{-, 1). T/ien the spectral radius ofB is 
less than 1. 

Proof. Observe that /'(I) = L + Q{1, •) + Q{-, 1). As {A,X) is subcritical, Propo- 
sition 2.5 implies that the spectral radius of /'(I) is less than one. Then it follows that 
the spectral radius of B is less than one as well, using the theory of M-matrices and 
regular splittings, see [5], Theorem 6.2.3 part P48. □ 

To complete the proof of Theorem 4.5 it suffices to show the following proposition. 

Proposition C.ll. Let A be subcritical and a be any depth-first scheduler Let B := 
(L + Q(l, •))* (3(-, 1) and p the spectral radius of B. Then {] < p < 1 and 
Pr[5'^ > fc] e e{p^), i.e, there are c,C > such that cp^ < Pr[S"^ > k] < Cp^ 
for all k. 

Proof. Wehavep < 1 by Lemma C.IO. To show p > 0, it suffices (by Perron-Frobenius 
theory [5]) to show that all row sums of B are (strictly) positive. For this, letY G F be 
the index of an arbitrary row. Then, by compactness of the task system, there are types 

Xo,...,X^(0 < i < n-1) such that Y = X^ and X^ A ....Xi A Xq 

Po 

and Xq ' — > ZW for some Z,W ^ F. It is straightforward to show by induction on i 
that the {¥, Z)-entry of 7,'Q(-, 1) is positive. It follows that the {¥, Z)-entry of B is 
positive, so p > 1. 

For the upper bound, observe that with Proposition C.8 we have 

s[k +1] = {L + Q{1 - s[k], ■))* Q{; l)s[k] < Bs[k] . (6) 

By a simple induction it follows s[k + i] < B^s[k]. As the absolute values of the 
eigenvalues of B are bounded by p we get ||s[fc + i] || < Cip* for some Ci > 0, which 
impUes the claimed upper bound. 

For the lower bound, observe that there is a real number < r < 1 such that for all 
types Y G F, the probability that X reaches Y is at least r. So it suffices to find any 
y e r such that there is a ci > with Pt[S^ >k]> c^p^ for all k. 

Recall that p is the spectral radius of B. It is a corollary (Corollary 2.1.6 of [5]) of 
Perron-Frobenius theory that B has a principal submatrix B' which is irreducible and 
also has spectral radius p. We write F^ for the subset of F such that B' is obtained 
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from B by deleting all rows and colunms that are not indexed by F-f. Also by Perron- 
Frobenius theory, B' has an eigenvector u' £ (0, oo)^^ with B'u' = pu' so that u' is 
positive in all components. Define u e [0, oo)^ as the vector with uy = u'y > for 
Y e F-f and ity = for F ^ F-^. Hence we have Bu > pu. By the already proven 
upper bound there is a f > such that s[k] < tp'^ for all k. We abbreviate Cfc := tp'^ so 
that s[k] < efel. 

Now we show that there is a natural number k and a real number d > with Cfcrf < 1 
such that for all i > 

s[k + i]> // ^n(l - ek+j-id)j u . (7) 

As Uy = for F ^ it suffices to show s[k + i] >^ (nj=i(l ~ ^k+j-idfj u 
where by the notation v >^ w we mean vy > wy for all Y G F^. We proceed by 
induction on i and determine the constants on the fly. For the induction base (i = 0) 
observe that, as s[k] is positive by compactness of the task system, we can enforce 
s[k] > u by scaling down u by multiplying it with a small constant. This does not 
affect the stated properties of u. For the step, let i > 0. We have 

s[k + i + l] = {L + Q{1 - s[k + i], •))* + i] (by (6)) 

> ((1 - ek+i){L + Q(l, •)))* 0(-, l)s[fc + i] (as s[k + i]< Ck+iV, 



> ((1 - €k+i){L + Q{1, •)))* Qi; l)p' - Efc+j-id) j u (ind. hypofliesis) 

(i \ /for a large k and\ 

- Ck+j-id) I u I some matrix K by j 

3=1 J \LemmaC.9 / 

> ^11(1 ~ efe+j-i'^)^ (P'" - Ck+iKBu) (as > pu) 
>T n (1 - Efe+j-i^^) ) (Pw - ek+ipdu) ( J'g J Ij^S^^^^^ '^'*) 



This proves (7). So, denoting by Umin > the smallest nonzero component of u, we 
have 



s[A; + i]y > p^ ~ ^k+j-id) j Wmm for aU Y G F-^ and alH > 0. 
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Thus the proof is completed if 11^^(1 ~ ^j^) > 0. To see that this inequahty holds, 
observe that 1—ejrf = l—tp>d> 1 — is true for almost all j and that n^2(l^p') ~ 
I > 0. This completes the proof. □ 



D Proofs of Section 5 
D.l Proof of Theorem 5.1 



Theorem 5.1. The expectation E[6'°p] is finite (no matter whether A is critical or 

subcritical). Moreover, 0{b) terms compute b hits o/E[S'°p]. If the task system A is 
subcritical, then log2 b + C(l) terms compute b bits of¥\S°P]. Finally, computing k 
terms takes time 0{k ■ \r\^) inthe unit cost model. 

Proof. Note that the second statement implies the first one. Let e^*^ := 1 — i/^j^. Then 
we have EfS"?] - Eto (1 - ^x^) = T,Zk follows from [12] that there is a 

ci e (0, oc) such that for all i G N we have e^'^ < ci ■ 2~*/("^") where n = ITj. Using 
this inequality we get 

oo oo 

J2 e^'^ < ci 1^ 2-^/("2") < C2 . 2-'=/("2") 

i=k i=k 

withc2 = ci/(l-2-i/("2")) Choosing fc = [(ft+logs C2)n2"] we obtain X),^/; eW < 
which proves the second statement. 

For the third statement (about subcritical systems) recall from Corollary 3.5 that 
there arec>OandO<d<l such that e^*) < c • d^' for all i e N. So 

CXD OO OO 

T^e^^<T.c.d-<c.Y^d^'-^ = ^^.d^\ 

i=k i=k i=0 

By choosing a natural number k with k > — logj (— log2 d) + log2 6 + 1 we obtain for 
all b > log that ■ d^ < 2^^ which proves the third statement. 

The final statement follows from Corollary 3.4. □ 



D.2 Proof of Theorem 5.2 



Theorem 5.2. If A is subcritical, then E[S'°'] is finite for every online scheduler a. If 
A is critical, then E[S'^] is infinite for every online scheduler a. 

Proof. Let A be subcritical. By Theorem 4.1 we have for every online scheduler a 

oo oo 

ns'^] = j2Pr[s^>k]<j2 < 00 , 

fe=l k=l ™*" 
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because it is a geometric series. 

Let now A be critical. The proof follows the lines of the proof of Theorem 4.1. By 
Proposition 2.5 we have p{f'{l)) = 1 for the spectral radius of /'(I). 

Let us fix an online scheduler a. First we prove E[5"^] = oo for the case in which Xq 
is reachable from every type X G F. Later we will show how to drop this assumption. 
If Xq is reachable from every X, it follows that /'(I) is an irreducible matrix. Then 
Perron-Frobenius theory [5] guarantees the existence of an eigenvector u G of 
/'(I) which is positive in all components, i.e., /'(l)tt = uandux > for all X G F. 
W.l.o.g. we can choose It such that its largest component is 1. Let again m^*) := z*^*^ 'U. 
Note that m^^' = uxo > and m^'^ < Iz'^^l where \z^^^ \ denotes the sum of the 
components of z^^\ Also note that m^*^ returns a weighted sum of the components 
of z^^\ Loosely speaking, we wiU show that its expectation remains constant. 

Let us consider i>l. Let y = c^^^ , • ' ■ > c^*^ be a sequence of elements of with 
c*^*) ^ 0, and let Ty be the set of all family trees t satisfying z^^\t) = c^^^ for every 
1 < j < *• Note that m^*) {t) ^ 0. Observe that m^'^ is constant over Ty, we denote by 
m^^{Ty) its value over Ty. 

An easy computation reveals that for every X £ Fwe have 



E 



r'^ I Ty 



which gives 



E 



T. 



(8) 



(where /^^ (y) (1) denotes the row vector indexed by A„ (y)). Consequently, we have: 



E 



m 



(i+i) I J. 



E 



^(i+i) I 

Z« I Ty 



•u 



-E 



-E 



I Ty 



= m^^\Ty) + f^^^y^{l)u-{A,{y))'U 
= m^'\Ty) 

Also clearly E [m('+^' | m^'^ = O] =0, and hence we have 



(def.ofm('+^') 
■u (def.ofrW) 

(by (8)) 

(def.of m(*)(Tj^)) 
(as f'{l)u = u) 



E rn('+i) 



m 



i.e., the sequence m'^^\m^^\ ... is a martingale. 

Define the stopping time Tk := mf{i > 1 | m^*^ G {0} U [fc, oo)}. Note that 
mi-'k) < fc + 2 as w < 1, and hence that m(^'=) e {0} U [k,k + 2]. We wish to 
apply Doob's Optional-Stopping Theorem [28] (sometimes called Optional-Sampling 
Theorem) to infer that E [to^'^'=)] = E [m^^^] = uxq ■ To this end we define the sequence 
m(^\ m(^\ ... by setting m^'^ := m^'^ for i < Tk and m('' := m^'^''^ for i > Tk. The 
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sequence m^^^ , m^^^ , ... is a martingale as m^^^ , m^^^ , ... is a martingale. To apply the 
Optional-Stopping Theorem we also need to make sure that |m'^*+^^ — m^*^ | is bounded 
by a constant, which is the case as m^*) G [0, A; + 2] for all i. Doob's Optional-Stopping 
Theorem now yields 



Recall that this is > 0. Since m^'"") e {0} U[k,k + 2], 



UXo=^ 



Ark) 



< 0-Pr 



which gives 



So we have 



Pr 



-(fc+2)-Pr 



i^^"^ > k 



m 



> 



k + 2 



(fc+2)-Pr 



i^^"^ > k 



PrfS"" > fc] = Pr 



sup|2(*)| > k 



> Pr 



supm 



«>fc 



> 



k + 2 



Hence, 



Et^-"] =^Pr[S"^ > A;] >^ 



fc=i 



fc=i 



A; -1-2 



= 00 



which completes the proof for the case where Xq is reachable from all types. 

Now we show that E[S''^] = oo also holds when Xq is not reachable from all types. 
Recall that p{f'{l)) = 1. It is a corollary (Corollary 2.1.6 of [5]) of Perron-Frobenius 
theory that /'(I) has a principal submatrix B which is irreducible and has spectral 
radius p{B) = 1. Let F' C F denote the set of types such that B is obtained from 
/'(I) by deleting all rows and columns not indexed by F'. Consider the task system 
A' which is the original task system restricted to F'. More concretely. A' has types F' 

and transition rules as follows: A rule X a' is in Zi' iff X e .T' and there is an 
a G Mp such that X ^ a is in the original task system and a' is obtained from a by 
deleting the types that are not in F'. Let g : denote the pgf for A'. From 

the construction of A' it is straightforward to see that B = g'{l). Pick an arbitrary 
X € F' as the initial type of A'. As B ^ is irreducible, X is reachable from all 
types in F'. Hence, the first part of the proof appUes and we obtain that, in Z\', we have 
E[S'^] = 00 for all online schedulers a. As A' was obtained by erasing types and rules 
from the original task system, it is easy to see that, also in the original task system, we 
have E[S'^] = oo for all onUne schedulers a. As X is reachable from Xq, it follows 
E[S"^] = 00 for all online schedulers a. □ 
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